Retrieval of Web Documents Using a Fuzzy Hierarchical Clustering
نویسندگان
چکیده
منابع مشابه
Retrieval of Web Documents Using a Fuzzy Hierarchical Clustering
The World Wide Web has huge amount of information that is retrieved using information retrieval tool like Search Engine. Page repository of Search Engine contains the web documents downloaded by the crawler. This repository contains variety of web documents from different domains. In this paper, a technique called “Retrieval of Web documents using a fuzzy hierarchical clustering” is being propo...
متن کاملA Novel Indexing Technique for Web Documents using Hierarchical Clustering
The information on the WWW is growing at an exponential rate; therefore, search engines are required to index the downloaded Web documents more efficiently. Web mining techniques like clustering can be used for this purpose. In this paper, a novel technique to index the documents is being proposed that not only indexes the documents more efficiently but also uses hierarchical clustering to keep...
متن کاملClassification of Web Documents using Fuzzy Logic Categorical Data Clustering
We propose a categorical data fuzzy clustering algorithm to classify web documents. We extract a number of words for each thematic area (category) and then, we treat each word as a multidimensional categorical data vector. For each category, we use the algorithm to partition the available words into a number of clusters, where the center of each cluster corresponds to a word. To calculate the d...
متن کاملAutomatic thematic categorization of documents using a fuzzy taxonomy and fuzzy hierarchical clustering
In this paper we formally define the problem of automatic detection of thematic categories in a semantically indexed document, and identify the main obstacles to overcome in this process. Furthermore, we explain how detection of thematic categories can be achieved, with the use of a fuzzy quasi-taxonomic relation. Our approach relies on a fuzzy hierarchical clustering algorithm; this algorithm ...
متن کاملHierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics
This paper discusses about the future of the World Wide Web development, called Semantic Web. Undoubtedly, Web service is one of the most important services on the Internet, which has had the greatest impact on the generalization of the Internet in human societies. Internet penetration has been an effective factor in growth of the volume of information on the Web. The massive growth of informat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2010
ISSN: 0975-8887
DOI: 10.5120/921-1299